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Abstract 

This paper investigates the information-theoretic limits of energy-harvesting (EH) channels in the 
finite blocklength regime. The EH process is characterized by a sequence of i.i.d. random variables with 
finite variances. We use the save-and-transmit strategy proposed by Ozel and Ulukus (2012) together 
with Shannon’s non-asymptotic achievability bound to obtain lower bounds on the achievable rates for 
both additive white Gaussian noise channels and discrete memoryless channels under EH constraints. 
The first-order terms of the lower bounds of the achievable rates are equal to C and the second-order 
(backoff from capacity) terms are proportional to where n denotes the blocklength and C 

denotes the capacity of the EH channel, which is the same as the capacity without the EH constraints. 
The constant of proportionality of the backoff term is found and qualitative interpretations are provided. 
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I. Introduction 

The energy-harvesting (EH) channel consists of one source equipped with an energy buffer, 
and one destination. For simplicity, in this paper, we assume that the buffer has infinite capacity. 
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At each discrete time k G {1,2,. ..}, a random amount of energy E k G [0, oo) arrives at the 
buffer and the source transmits a symbol X k G (—oo, oo) such that 

k k 

< y Ei almost surely. (1) 

t =i ^=i 

This implies that the total harvested energy must be no smaller than the energy of 

the codeword at cvcr Y discrete time k for transmission to take place successfully. We 

assume that {E(}fy l are independent and identically distributed (i.i.d.) non-negative random 
variables, where E[£i] = P and E [Ef] < +oo. The destination receives 

Efc = X k + Z k (2) 


at time slot k for each k G N where {Z k }’jf =1 are i.i.d. standard normal random variables. We 
refer to the above EH channel as the additive white Gaussian noise (AWGN) EH channel. It was 
shown by Ozel and Ulukus [1] that the capacity of the AWGN EH channel is 


C=^log(l + P) 


nats per channel use, 


(3) 


where P = E [Tv j ] is the expectation of the harvested energy for each energy arrival. The AWGN 
EH channel models real-world, practical situations where energy may not be fully available at 
the time of transmission and its unavailability may result in the transmitter not being able to put 
out the desired codeword. This model is applicable in large-scale sensor networks where each 
node is equipped with an EH device that collects a stochastic amount of energy. See [2] for a 
comprehensive review of recent advances in EH wireless communications. 

Observe that the capacity in (3) is unchanged vis-a-vis the AWGN channel without the EH 
constraints. The capacity is an asymptotic concept, defined as the number of channel uses n is 
allowed to tend to infinity. The result of Ozel and Ulukus [1] thus masks the higher-order rate- 
losses that may arise due to the EH constraints. However, in many real-world applications, delay 
or latency constraints are present, e.g., we cannot wait infinitely long to decode the transmitted 
codeword. This begs the natural question: Information-theoretically, and from a finite blocklength 
(non-asymptotic) [3,4] perspective, what are the rate-losses for various channels when we take 
the EH constraints into account? This is what this paper investigates both for the AWGN EH 
channel and its discrete memoryless counterpart. One of the main takeaways from this work 
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is that, compared to when EH constraints are not present (cf. [3]), there can potentially be a 
significant backoff from capacity at moderate blocklengths if one uses the save-and-transmit 
strategy [1] to take the EH constraints in account. 

A. Main Contribution 

The contributions of this paper are threefold: 

1) First, we prove achievable finite blocklength bounds for EH channels under the constraint 
in (1) based on the save-and-transmit strategy of [1], During the saving phase of the 
save-and-transmit strategy, we save energy for a certain number of time slots. During this 
period, no information is transmitted. Subsequently, during the transmission phase, we use 
the remaining time slots to send information. By carefully developing various concentration 
bounds to control the probability that the available energy is insufficient to support the 
transmitted codeword during the transmission phase (i.e., that Ee < Eli**), we 
show that the backoff from capacity C at a blocklength n is no larger than Oi \Jnr } logn). 
In other words, the maximum number of codewords we can transmit over n channel uses 
with average probability of error no larger than e, denoted by M* £ , satisfies 



(4) 


We also identify the implied constant k and provide qualitative interpretations. Such an 
analysis for noisy channels was not available prior to the present work. Furthermore, by 
scrutinizing the analysis of Ozel and Ulukus [1] for AWGN channels, one can also deduce 


that the backoff from capacity is no larger than 0(n 1/2 logn). Thus, our analysis results 


in a slightly smaller (tighter) backoff than what was implied by the authors in [ 1 ], 

2) Second, our analysis only requires minimal statistical assumptions on the EH process 
{EejeLi- Indeed, apart from assuming that the process is i.i.d., we only assume that the 
second moment of the EH random variable Eg is bounded, i.e., 


E [Ef\ < oo, VfeN. 


(5) 


In previous results such as [1, Lemmas 1 & 2], more restrictive assumptions on Ei were 
made, e.g., that E[e E «] is bounded for some 7 e (0,1). This assumption may be hard to 
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verify in practice. 

3) Finally, we study both AWGN and discrete memoryless EH channels. Previous work on 
finite blocklength analysis for EH channels was performed by Yang [5] for noiseless binary 
channels. The analysis required the use of sophisticated martingale convergence theorems 
and renewal process theory. Our analysis is comparatively simple, making use of basic 
probability techniques such as Markov’s and Chebyshev’s inequalities. Furthermore, we 
also analyze finite blocklength fundamental limits of noisy channels. 

B. Related Work 

Information-theoretic characterizations of EH communication channels have been investigated 
recently. As energy arrives randomly to the transmitter, codewords must satisfy the cumulative 
stochastic energy constraints. The impact of the stochastic energy supply on the channel capacity 
was characterized for an additive white Gaussian noise (AWGN) channel with an i.i.d. EH process 
in [1] and with a stationary ergodic EH process in [6]. The aforementioned studies showed that 
with an unlimited battery, the capacity of the AWGN channel with stochastic energy constraints 
is equal to the capacity of the same channel under an average power constraint, as long as the 
average power equals the average recharge rate of the battery. 

Using Shannon’s coding scheme for channels with causal state information at the encoder [7], 
the zero battery case for the same problem was discussed in [8]. Jog and Anatharam [9] 
characterized the capacity of the AWGN channel with a finite battery when energy arrivals 
are deterministic. Dong, Famia and Ozgiir [10] provided an approximation to the capacity with 
bounded guarantee on the approximation gap for i.i.d. Bernoulli energy arrivals. In recent work, 
Shaviv, Nguyen and Ozgiir [11] provided an n-letter expression for the channel capacity with and 
without causal and noncausal energy arrival information at the transmitter and/or the receiver. 
In addition, Shaviv and Ozgiir [12] investigated a similar problem with a Bernoulli recharge 
process. 

Mao and Hassibi [13] investigated the capacity of an energy-harvesting transmitter with finite 
battery over a discrete memoryless channel (DMC). It was shown that the capacity can be 
described using the Verdu-Han general framework [14]. If the transmitted symbol only depends 
on the currently available energy, the system reduced to a finite-state channel. However, it was 
analytically intractable to explicitly characterize the capacity, and even the lower bound of the 
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capacity can only be evaluated numerically. A special scenario of the same problem, namely the 
capacity of noiseless binary channel with binary energy arrivals and unit-capacity battery, was 
discussed in [15]. The channel was shown to be equivalent to an additive geometric-noise timing 
channel with causal information of the noise available at the transmitter. Achievable strategies 
were proposed along with upper bounds, which were then improved in [16]. Ozel et cil. [17] 
considered a noiseless binary energy harvesting channel with on-off fading. 

As mentioned above, finite blocklength analysis for EH channels was only considered previ¬ 
ously by Yang [5]. However, the channel considered therein is noiseless and has binary inputs and 
binary outputs. Our framework is considerably more general and we consider noisy discrete as 
well as Gaussian channels from a finite blocklength perspective. The study of finite blocklength 
fundamental limits in Shannon-theoretic problems was undertaken by Polyanskiy, Poor and 
Verdu [3]. Such a study is useful as it provides guidelines regarding the required backoff from the 
asymptotic fundamental limit (capacity) when one operates at finite blocklengths. For a survey, 
please see [4]. 

C. Paper Outline 

This paper is organized as follows. The notation used in this paper is described in the next 
subsection. Section II states the formulation of the AWGN EH channel and presents our main 
theorem. Numerical results are also provided. Section III describes the save-and-transmit strategy 
and proves our main theorem. More specifically, we use Shannon’s achievability bound [18] to 
prove an achievable rate for the save-and-transmit strategy. Section IV provides and proves an 
analogous result for discrete memoryless EH channels. Concluding remarks are provided in 
Section V. 

D. Notation 

We let 1(E) be the indicator function of the set E. We use the upper case letter X to denote 
an arbitrary (discrete or continuous) random variable with alphabet X, and use a lower case 
letter x to denote a realization of X. We use X n to denote the random tuple (X\,X 2 ,..., X n ). 

The following notations are used for any arbitrary random variables X and Y and any real¬ 
valued function g with domain X. We let px.r and py\x denote the probability distribution of 
(X, Y ) and the conditional probability distribution of Y given X respectively. We let px,y(x, U) 
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and Py\x(d\x) be the evaluations of p X y and p Y \ x respectively at (X. Y ) = (x, y ). To make the 
dependence on the distribution explicit, we let Pr Px {g(X) e A} denote f xeX p x (x)l{g(x) G 
A} dx for any set TCK. The expectation and the variance of g(X) are denoted as E Px [g(X)] 
and Var Px [g(X)\ = E px [(g(X) - E px [g(X)}) 2 ] respectively. We let A f(-] g, a 2 ) : K —>• [0, oo) 
denote the probability density function of a Gaussian random variable whose mean and variance 
are p and a 2 respectively, i.e., 

(6) 

We will take all logarithms to base e throughout this paper. 

II. Additive White Gaussian Noise Energy-Harvesting Channel 
A. AWGN EH Model 

The AWGN EH channel consists of one source and one destination, denoted by s and d 
respectively. Node s transmits information to node d in n time slots as follows. Node s chooses 
message W and sends W to node d, where if' is uniformly distributed over {1, 2,..., M} 
and M = | W|. Then for each k e {1, 2,..., n}, node s transmits X k e E and node d receives 
Y k e M in time slot k. Let Ei, E 2 ,..., E n be i.i.d. random variables that satisfy PrjEd < 0} = 0, 
E[f?x] = P and E [Ef] < oo. We assume the following for each k e {1, 2,..., n}: 

(i) E k and (11' E l ' ~ 1 , X k ~ 1 , Y k ~ 1 ) are independent, i.e., 

Pw,E k ,. Yfe-yyfe- 1 = PE k Pw,E k ~ 1 ,X k ~ 1 ,Y k - 1 (7) 

(ii) Every codeword X n transmitted by s should satisfy 

r k 1 

Pr EE<E Bt f = 1 (8) 

1 1=1 1 =1 J 

for each fc e {1,2 ,..., n}. 

After n time slots, node d declares W to be the transmitted W based on Y" . Formally, we define 
a code as follows: 

Definition 1: An (n, M)-code consists of the following: 

1) A message set W = {1, 2,..., M} at node s. Message W is uniform on W. 
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2) A sequence of encoding functions f k : WxMi —* M for each /c G {1,2,..., n}, where f k is 
the encoding function for node s at time slot k for encoding X k such that X k = f k (W, E k ) 
and (8) holds. 

3) A decoding function ip : —s- W, for decoding W at node d by producing W = p(Y n ). 
Definition 2: The AWGN EH channel is characterized by q Y \x such that the following holds 

for any (n, M)-code: For each k G {1,2,..., rz}. 


P\V,E k ,X k ,Y k ~ Pw,E k ,X k ,Y k ~ 1 PY k \X k 


(9) 


where 

PY k \x k (yk\xk) = qY\x(Vk\xk ) = A f{y k - x k ; 0,1) (10) 


for all x k and y k . Since p Yk \x k does not depend on k by (10), the channel is stationary. 

For any (n, M)-code defined on the AWGN EH channel, let p WEn x „ Y „ lY be the joint 
distribution induced by the code. We can factorize p WEn Y n w as follows: 


(a) 


Pw,E n ,X n ,Y n ,W — PW,E n ,X n ,Y n P\V\ Y n 

= PW Y\.P Ek \ W ’E k - 1 X k ~ 1 ,Y k - 1 Px k ,Y k \W,E k ,X k - 1 ,Y k - 1 ) X Pw\Y™ 


<k= 1 


— PW ( 'WP E kPx k ,Yk\W,E k ,X k - l ,Y k ~ 1 ) Pw\Y n 
= Pw\ ’Y\pE k Px k \W,EkX*- 1 ,Y k - 1 PY k \W,E k ,X*,Y k -' l \Pw\Y'n 


\k =1 
n 


(b) 


(9) 


PW I W_PE k Px k \W,E k PY k \W,E k ,X k ,Y k ~ l ) Pw\ Y n 


\k =1 


PW Y\_P E kPx k \W,E k PY k \X k I Pw\Y «■ 


( 11 ) 


\k =1 


where 

(a) uses the fact by Definition 1 that W is a function of Y n . 

(b) uses the fact by Definition 1 that X k is a function of (W, E k ) for each fc G {1,2,..., n}. 
Definition 3: For an (n, M )-code defined on the AWGN EH channel, we can calculate, 

according to (11), the average probability of decoding error defined as Pr [W f M 7 }. We call 


an (n, M )-code with average probability of decoding error no larger than e an (n. M. e)-code. 

Definition 4: Let e 6 [0,1) be a real number. A rate R is e-achievable for the AWGN EH 
channel if there exists a sequence of (n,M n ,e n )-codes such that 

liin inf — log M n > R, and limsupe n < e. (12) 

n—>oo Tl n_>-oo 

Definition 5: Let e 6 [0,1) be a real number. The e-capacity for the AWGN EH channel, 
denoted by C e , is defined to be C £ = supjf? : R is e-achievable}. 


B. Main Result 


The following theorem is the main result in this paper. The proof is contained in Section III 
after we illustrate the result numerically in Section II-C. 

Theorem 1: Let e 6 (0,1), and define 


= max{E PEi [Ef\,12y/2P 2 } . 


Suppose n > 3 is a sufficiently large integer such that 

E Ml [Ef] 


n 


log?r 


> max 


R2 


,12v / 2 l , 


(13) 


(14) 


n > ( log 


and 


n log n > 


2 + £ 


e°' 4 (2 + e) 


(15) 


(16) 


Then, there exists an (n + m, M, c)-codc such that 


log M > |log(l + P) - \l - 1 


(17) 


where m = 


6y/an log n 
P 


denotes the length of the initial saving period before any transmission 


occurs and n denotes the length of the actual transmission period. In particular, there exists 
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an (n*, M, £)-code with n* = n + m such that 


, „ r n* , , 3 log(l + P)Jan* logn* 

log M > — log(l + P)-^-jf-— 




( 18 ) 


Remark 1: Since -y- Ylk= i Ek converges to E p;;| [P| = P with probability one by the strong 
law of large numbers, it follows from the power constraint ( 8 ) and the strong converse theorem 
for the AWGN channel [19,20] that the e-capacity of the AWGN EH channel is upper bounded 
by \ log(l + P). Therefore, by normalizing both side of (18) by n and taking the limit, we see 
that Theorem 1 implies that the e-capacity is 


C e = ^log(l + P), Ve G [0,1). 


(19) 


Remark 2: The investigation of the save-and-transmit scheme by Ozel and Ulukus in [1, 
Lemma 2] implies that ^ log(l + P) — 0(\/n* (log n*) Q ) nats is achievable over n channel uses 
for any a > 1 and for n —> oc. Theorem 1 improves the lower bound of the second-order term 
because the backoff term improves from — y/n*(logn*) a to — \Jn* log n*. 

Remark 3: It follows from (13) and (18) that the coefficient of the second-order term achieved 
by the save-and-transmit strategy is at least 

—31og(l + P)J max {Ep.JBf], 12x/2f> 2 } 
v i- V ---i-. (20) 

Note that (18) is a direct consequence of (17) and the derivation can be found in the equations 
from (61) to (64) in the proof of the theorem. By inspecting the aforementioned derivation, 
we see that the second-order term in (18) is due to the saving period only, which means the 
second-order term v is affected by the length of the saving period m alone (but not e). As P 
increases, the magnitude of v increases and hence a longer saving period is required to guarantee 
a certain probability of outage , namely that the transmitted codeword does not satisfy all the 
EH constraints. This corroborates the fact that as P increases, the variance of each Gaussian 
codeword increases and hence a longer saving period is required to maintain a certain outage 
probability. Similarly, as E PBi [Ef] increases while P is fixed, the variance of the energy arrival 
process is larger and hence a longer saving period is required to maintain a certain outage 
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Fig. 1. Achievable rates for the save-and-transmit scheme in (21) when the error probability is varied (left) and when the 
variance of the EH process is varied (right). On the plot on the right (Figure 1(b)) for the No-EH line, the peak power (cf. (22)) 
is kept at P = 3 dB 


probability. 

Remark 4: We use Chebyshev’s inequality to obtain the third term in the asymptotic expansion 
in (18), i.e., the one proportional to y/n*. One could also use the Berry-Esseen central limit 
theorem [21, Ch. XVI.5] to obtain a possibly better bound. However, more terms, such as the 
third absolute moments of certain random variables, would be involved. In addition, only the 
coefficient of the third-order term in (18) can be improved slightly by using Berry-Esseen’s 
theorem instead of Chebshev’s inequality, and the improvement is minimal compared with the 
first- and second-order terms in (18). Thus, we have chosen to present a simpler achievability 
bound. 


C. Numerical Results 

In this section, we illustrate achievable rates as a function of n per Theorem 1. We do so for 
E[Ei] = P = 3dB and various values of e and Var[E|] = E [Ef] — P 2 . More specifically, we 
define 

f io g (i + p) - 

£>(EH) A 2 V ; 


P 


6 y/an log n 


( 2 +^ P _ 1 _ ! 
e(P+ 1 ) n 1 


(nats per channel use) to be the non-asymptotic rate achievable by save-and-transmit according 
to (17). We plot R nte against n in Figure 1 for E[Ei] = P = 3dB and various values of e and 
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Var [Ei], corresponding to the lines indicated as “(EH)” respectively. 

In order to demonstrate how much the EH constraints ( 8 ) degrade the non-asymptotic achiev¬ 
able rates compared to the peak power constraint 

Pr {E x '^ nP } = 1 > (22 > 

in Figure 1 we also plot the optimal transmission rate i?, ( f°' EH) under the peak power con¬ 
straint (22), corresponding to the lines indicated by “(No-EH)”. Due to Polyanskiy-Poor-Verdu 
[3, Th. 54, Eq. (294)] and Tan-Tomamichel [22, Th. 1], 

<r H >=c (P )+ yxM 4 - 1(£)+ ^ +0 (i) 

(nats per channel use) where 

a P(P + 2 )(loge) 2 , , 2 , , N 

V(F) =-—-—- (nats per channel use) 

2 (P + l) 2 

is known as the Gaussian dispersion function and <f > _1 is the inverse of the cumulative distribution 
function for the standard Gaussian distribution. We ignore the final correction term in (23) when 
we plot -R^°' EH) because it is negligible compared with the first three terms. 

In Figure 1(a), we see that as £ increases, the backoff of both and from the 

capacity decreases, which is due to the increase of the magnitude of the second term in (21). In 
Figure 1(b), we see that as Var [if i] increases for a fixed E|ifi], the backoff from the capacity 
increases, which is due to the explanation in Remark 3 that a longer saving period is required 
as E [E\] increases. As we can see from Figures 1(a) and 1(b), the performance degradation due 
to the EH constraints and the save-and-transmit strategy compared to the peak power constraint 
is significant. 


(23) 


(24) 


III. Save-and-Transmit Strategy 

In this section, we investigate the save-and-transmit scheme proposed in [1, Sec. IV] in the 
finite blocklength regime. We use this achievability scheme to prove Theorem 1. 
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A. Prerequisites 


The following lemma is useful for obtaining a lower bound on the length of the energy-saving 
phase. The proof is deferred to Appendix A. 

Lemma 1: Let m and n be two natural numbers. Suppose {X k }% =1 and { E f .}™=f are two 
sequences of independent and identically distributed (i.i.d.) random variables such that X n and 
E Tn +n are independent 1 , 

Pr P£i {T; 1 <0} = 0, (25) 

and 

E Ml [Bil=E Ki [At] = P. (26) 


In addition, suppose there exist two positive numbers A and a such that 


max <| E PE] [E 9 {],E PXi 


X?e XX ' 


< a. 


(27) 


If n > 3 is sufficiently large such that 


n fa 1 . 


(28) 


then we have 2 
Pr 


n f k m+k t t / 04 , 

)U 

k —1 v ■v—1 k, — 1 ) ) 


(29) 


Corollary 2: Suppose {X k }^ =1 and {E k }™=™ are two sequences of independent and identi¬ 
cally distributed (i.i.d.) random variables such that X n and E" L+n are independent and also satisfy 
(25) and (26). If X 1 is a Gaussian random variable, then by setting a = max {E PBi [Ej], 12y / 2P 2 j 


'We note that the random variable X n here denotes the intended rather than the actual symbols to be sent, and the intended 
symbols X n are chosen to be independent of the EH process E m+n . More specifically, in the proof of Theorem 1, we construct 
the code such that the last n symbols of the intended codeword to be sent over m + n channel uses (which plays the role of X n 
in Lemma 1) are independent of the EH process E m+n . However, the actual transmitted codeword is a function of (X n , E rn+n ) 
and is correlated with E rn+n so that the EH constraints (8) are satisfied. 

2 The constant e 0A can be further tightened to e losn/ ^ n by inspecting the proof. Since e 0 ' 4 and e logn ^ n differs by a factor of 
at most e 0 ' 4 , we prefer stating the upper bound in terms of e 0,4 to simplify analysis. 
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we have 


m-\-k 


Pr 


PX n P Em+n 


U < 


. k=\ t 7=1 i =1 

for all sufficiently large n that satisfies 


,0.4 


log 71 


^logn-^V^ 


(30) 


U > max l EpE L [E2l] , 12v^ ■ 


log n 


P 2 


Proof: Suppose A"! is a zero-mean Gaussian random variable with variance P. Then, 
straightforward calculation using (6) reveals that 


E 

Lj Px 1 



12V2P 2 . 


(31) 


Fix 



(32) 


and fix an 


» s max IE,,. (£?\E,,. [.YfeW 
^{El].12V2P 2 


(a) 1 

= max < E r , 


(33) 


where (a) follows (31) and (32). It then follows from Lemma 1 that (30) holds for all sufficiently 
large n > 3 such that 


n 

log n 


> max 


(32) 

= max 


(33) 

= max 


(a) 

= max 


a 1 1 

P2’a X 2 ) 

' a 16 P 2 ) 

) 



,12 A 


min 



16P 2 16P 2 | | 

12v^2 P 2 ’E PBi [E 2 ] J J 


( 34 ) 


where (a) follows from the fact that min 


16 P 2 


16 P 2 


12v^P a ’ E P£i [P 2 ] 


< 12 v^. 
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The following lemma [18] is standard for proving achievability results in the finite blocklength 
regime and its proof can be found in [23, Th. 3.8.1]. 


Lemma 3 (Implied by Shannon’s bound [18]): Let px n ,Y n 6c the probability distribution of a 
pair of random variables (X n . Y n ). Let (X"(z). Y n (i)}fi\ be a sequence of independent random 
variables where (A"”(1), Y n (1)) is distributed according to px n ,Y n • 6° r eac h <5 > 0 and each M e 
N, we have 


Pr 


M 


i=2 


U lo s 


p Y ^{Y n {l)\X n (i)) 
Py~(Y»( 1)) 


> log M + n8 > > < e 


—n5 


(35) 


B. Proof of Theorem 1 


Fix an e e (0,1). Define 

a = max |e pBi [E^], 12\/2P 2 |. 


(36) 


Fix a sufficiently large n > 3 such that (14), (15) and (16) hold. Define 

6 'Jan logn 


A 

m — 


P 


(37) 


which specifies the number of time slots which are used for saving energy. Consider the random 
code that uses the channel m + n times as follows: 

Save-and-Transmit Random Codebook Construction 

Let 0 m denote the length-m zero tuple. Let px be the distribution of a zero-mean Gaussian 
random variable X whose variance is P, i.e., 


Px(x) =J\f(x-,0,P). 


(38) 


In addition, let px « be the product distribution of the n independent copies of X. Construct M 
i.i.d. random tuples denoted by X n (l),X n (2),... ,X n (M) such that A n (l) is distributed ac¬ 
cording to px n , where M will be carefully chosen later when we evaluate the probability of 
decoding error. Define 


X m+n (i) 4 (0 m ,x n (i)) 


( 39 ) 
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for each i e { 1 , 2 ,..., M } and construct the random codebook 

{X m+n (i) \i e { 1 , 2 ,...,M}}. (40) 


The codebook is revealed to both the encoder and the decoder. To facilitate discussion, we let 
X k (i) and X k (i) denote the k lh symbols in X n (i ) and X m+n (z) respectively for each T Since 
the first m symbols of each random codeword X m+n (i ) are zeros by (39), the source will just 
transmit 0 with probability one until time slot m+ 1 when the amount of energy E k is 

available for encoding X m+l (Ik) = ) A"!(Ik). 

Encoding under the EH Constraints 

The source has the knowledge of E k before transmitting its symbol in time slot k for each 
k e {1, 2,..., m + n}. For each i e {1,2,..., M}, recalling that X k {i) is the k th element of 
X m+n (i) 1 (0 m , X n (i)), we construct recursively for k — 1, 2,... , rn + n the random variable 

[**(*) if(W(i)) 2 <Es t -E(x,(i,s')) 2 . 

W(i,S t ) = < (41) 

0 otherwise. 


To send message W which is uniformly distributed on {1,2,...,M}, the source transmits 
X k (W, E k ) in time slot k for each k 6 {1,2,... ,m +n}. Note that the source transmits 0 
with probability one in the first m times slots by (39) and (41), and the transmitted codeword 
(X 1 (W,E l ),X 2 (W,E 2 ),.. .,X m+n {W,E m+n )) satisfies the EH constraints ( 8 ) by (41). 


Threshold Decoding 

Upon receiving 






(Ik, E m+n ) + Z m+n 


(42) 


where 


X m+n (W, E m+n ) 4 (X^W, E 1 ),X 2 (W, E 2 ),..., X m+n (W, E m+n )) (43) 

denotes the transmitted tuple specified in (41) and Z m+n is a tuple consisting of i.i.d. standard 
normal random variables by the channel law (cf. ( 10 )), the destination constructs its subtuple 
denoted by Y n by keeping only the last n symbols of Y m+n . Recalling that q Y \x denotes the 
channel law and p x was chosen in the above codebook construction to be the distribution of the 
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zero-mean Gaussian random variable with variance P, we define the joint distribution 

Px,y—PxQy\x, (44) 


and define px n ,Y n to 6c the probability distribution of n independent copies of (. X , Y ) distributed 
according to px,Y, i-e., 

n 

Px^,Y^{x n ,y n ) = Y\_Px,Y(x k ,y k ) (45) 

k= 1 


for all (x n ,y n ) G M 2 . Then, the decoder declares <p(P n ) G {1,2,..., M} (with a slight abuse 
of notation, we write <p(P n ) instead of (p(Y m+n )) to be the transmitted message where ip(Y n ) 
is the decoding function defined as follows: If there exists a unique index j such that 


log 


Pyn\ xn (Y n \X n (j)) 

PY*(Y n ) 


> log M + ny 


(46) 


then p(Y n ) is assigned the value j. Otherwise, : p{Y r ) is assigned a random value uniformly 
distributed on {1,2,..., M}. 

Calculating the Probability of Violating the EH Constraints 

It follows from (43) and (41) that 


Pr X m+n {W, E m+n ) = X m+n (W) 


m-\-n ( k 


fl 


= 1 . 


(47) 


t= i 


fc=i ^ i =i 

Defining X n (W, E m+n ) to be the tuple containing the last n symbols of X m+n (W, E m+n ), we 
obtain from (47) and (39) that 

k m-\-k 


Pr { X n (W, E m+n ) = X n (W) 


fl { 'E(MW))' 2 <Y. E ‘ 


= 1 . 


(48) 


k =1 t 1 =1 Z=1 

Combining Corollary 2, (36) and (14) and noting that E m+n and (IP, X n {W)) are independent 
by construction, we obtain 


m-\-k 


Pr i U > E M 1 ^ 


,fc=i t i=\ 


i= i 


e ° A I 2 log 

3 ° 2 V an 


log n 


(49) 
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which implies that 

p r (n 1 E< A 'f ( w ')) 2 < E E < 


m+k 


,fc=1 I, i=\ 


1=1 



/ g0.4 y 

> 1 - 



\l°g n j 

(37) 

e 0.4 

> 1 - 



n log n 

(16) 

£ 

> 1 - 



2 + e 


D 21 ogn-^V^ 


(50) 


Using (48) and (50), we have 

Pr {X n (W, E m+n ) = X n (W)} > 1-— 

2 + e 


(51) 


Calculating the Probability of Decoding Error 

Defining Z n to be the tuple containing the last n symbols of Z m+n and recalling X n (W, E m+n ) 
and Y n are the tuples containing the last n symbols of X m+n (W, E m+n ) and Y m+n respectively, 
we obtain from (42) and (51) that 


Pr { Y n = X n (W) + Z n }> 1 - 


2 + e ’ 


(52) 


where X n (W) and Z n are independent and Z n consists of i.i.d. standard normal random variables 
by the channel law. Following (46) and (52), we define the events 3 

'p Y "\X"(X n (w)+Z n \X n (i )) 


£,\W = < log 


<logM4-n 4 


P Y n(X n (w) + Z U ) 

and consider the following chain of inequalities for each w e {1,2 ,,M}: 


(53) 


Pr 


= Pr 

Pw (.nr=i PX n (i)')Pz n 

— ^ >Y Pw(I\ , iLlPX™(i))Pz n 
(b) 

< Pr 


1 £w\w U £j\ w 

W = w i 

{ je{l,2,...,M}\{w} 

J 


{f.iiuU 


{%| W = 1} +Pr 


W = 1 


Pw(fI I=1 Px n (i))Pz 


) PJ „{u" 2 £f l i|W'=l} 


— * ‘-PWPx n (l)Pz n 


{£ 1 \ l \W=l} + e 


- ra 1 / 4 


T JL 1. 

J The term n* corresponds to the choice of nS = n 3 in Lemma 3, which can be replaced by any function of the order 
o(^/n log n) without affecting the achievable second-order term in (18) (cf. (55)). We do not optimize the choice of nS because 
it affects only higher-order terms which are negligible compared with the first- and second-order terms. 












18 


= Pr 


Px n (l)Pz n 


(c) n 

= Pr 


YTk=iPx k (i)Pz : 


{£i|i} + e 

{^i|i} + 


-„!/4 


-r,.V4 


(15) 

^ Pl n n k=1 Px kW Pz k l £ i|i} + 


2 + £ ’ 


(54) 


where 


(a) follows from symmetry of the random codebook construction. 

(b) follows from Lemma 3 and (53). 

(c) follows from the fact that that X n (l) and Z" are i.i.d. copies of Xi(l) and Z, respectively 
by construction. 


In order to ensure the first term in (54) can be upper bounded by a simple term, we choose M 
to be the unique integer that satisfies 


log(M + 1) > nE px Y 



f py\x(Y\X) Y 

V PvOO ). 


(2 + e)n 


Var, 


PX,Y 


log 


Py\x(Y\X) 

Py(Y) 


1 

n 4 


> log M. 


(55) 


Following (54), we consider the following chain of inequalities where the random variables are 
distributed according to Uk = iPx k (i)Pz k - 


Prj^ir}^ Pr 


( 55 ) 

< Pr 


(a) 

< 



log 


ELi log 

< nE Px 


PY\x(X k (l) + Z k \X k (l)Y 
PY(X k {l) + Z k ) 

PY\x(X k {l)+Z k \X k (l)) \ 
PY(X k {l)+Z k ) ) 


2 + £ ’ 


< log M + n 4 


log 


Py\x(Y\X)\ 
Py(Y) ) 


(2+e)n 


Var 


PX,Y 


1 og(^) 


(56) 


where (a) follows from Chebyshev’s inequality and the facts based on (44), (38) and (10) that 
(X k (l),X k (l )+Zfc) and (. X , Y ) have the same distribution for each k <E {1,2 ,..., n}. Combining 
(54) and (56), we obtain 


Pr 


1 £w\w U £j |tu 

W = w i 


J 


< 


£ + £ 2 


Pw (n"i PX n (i) )Pz n 


2 + e 


(57) 
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for each w G {1,2,... ,M}. We are ready to compute the probability of decoding error as 
follows, where the random variables are distributed according to Pw,x n {w)Pz™PY+w,x™(w),z™'- 

Pr [p(Y n ) ± W} ( < Pr { {cp(Y n ) ± W} n {Y n = X n (W) + Z n }} + 

< Pr {v(X"(W) + Z") ^W}+^- e 

(a) 

< e (58) 


where (a) follows from the threshold decoding rule (cf. (46) and (53)) and (57). Using (37), 
(40), (55) and (58), we conclude that the constructed code is an (n + m, M, e)-code that satisfies 


log(M + 1) > tiE Px y 



( Py\ x (Y\X) Y 

V Py(Y) J. 


(2 + e)n 


Var, 


PX,Y 




n 4 , 


(59) 


which implies from (44), (38) and (10) that 


n 


log (M + 1) > - log(l + P) - 


(2 + e)nP 

W+T) 


— n 4 , 


which then implies that 


n 


log M > — log(l + P ) 


(2 + e)nP 

<p+ 1 ) 


?z 4 — 1 


(n + m) — 7Ti , 

> 1 ’ -log(l + P) 


(2 + e)(m + n)P 

e{P+ 1) 


(m + n) 4 — 1. 


(60) 


( 61 ) 

(62) 


Equation (17) then follows from (61), (36) and (37). It remains to prove (18). Since m > 0 and 
m < 6 ^ a p log n + 1 by (37), it follows that 


6 \Ja{m + n) log(m + n) 
P 


6 a/ an log n 

+ 1>-—-hi >m, 


(63) 


which implies from (62) that 

, ,, m+n 

log M > - 


log(l + P) - 


3a/c 


(2 + e)(m + n)P 

e(P+ 1 ) 


(m+n) log (m+n) . 

- — -log(l + P) 

(m + n)3 - ^ log(l + P) - 1. 


(64) 
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Equation (18) then follows from (64), (36) and (37) by letting n* = m + n. 


IV. Discrete Memoryless Energy-Harvesting Channel 
A. Channel Model and Main Result 

We now consider a discrete memoryless EH (DM-EH) channel which consists of a finite input 
alphabet denoted by X, a finite output alphabet denoted by a transition matrix q Y \x and a cost 
function c : X —> M + . We assume that X contains a symbol denoted by 0 such that c(0) = 0, 
where 0 represents the idle symbol that consumes no energy. 

As an example of a cost function, let us consider a binary alphabet X = (0,1} and c(x) = x 
for all x G (0,1}. The cost function c characterizes the cost of sending a symbol x G X. For 
a length-?! sequence x n , the total cost is Yl k =i c ( x k)> which is equivalent to the weight of x n 
(number of ones in x n ). Since typically we need to expend some energy to transmit 1 while 
no energy is required to transmit 0 (the transmitter stays “silent”), Y^k= i c ( x k) is a reasonable 
measure of energy consumption for transmitting x n . 

At each discrete time k G {1,2,...}, suppose a random amount of energy E k G [0, oo) arrives 
at the buffer and the source s transmits X k G X such that 


fc-i 


Pr < c(X k ) < E k + £(£,-c(X,)) I =Prt £>(*,)< 


k 

£ 

e =i 


Ef > — 1. 


(65) 


t= i ) t, i=\ 

We assume that Ei, E 2 ,... are independent and identically distributed (i.i.d.) non-negative ran¬ 
dom variables, where E [Ei] = P and E [Ef] < +oo. The destination d receives Y k from the 
channel output in time slot k for each k G {1,2,.. .}, where p Yk \x k is distributed according to 
the channel law such that p Yk \x k (yk\xk) = qy\x(yk\x k ) for all (x k ,y k ) e X x After n time 
slots, node d declares W to be the transmitted W based on Y". We formally define a code for 
the DM-EH channel as follows. 

Definition 6: An (n, M)-code consists of the following: 

1) A message set VV = (1, 2,..., M} at node s. Message W is uniform on W. 

2) A sequence of encoding functions f k :Wx —» X such that X k = f k (W, E k ) and (65) 

holds. 

3) A decoding function p : y n —> W, where p is the decoding function for W at node d 
such that W = ip(Y n ). 
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Definition 7: The discrete memoryless EH (DM-EH) channel is characterized by q Y \ x suc h 
that the following holds for any (n, M)-c ode: For each k & { 1,2,..., n}, 

Pw,E k ,X k ,Y k = Pw.E k ,X k .Y k -'PY k \X k ( 66 ) 

where 

Pv k \x k (Vk\xk) = QY\x(yk\xk ) (67) 

for all (. XkiVk ) G Tf x Since Pv k \x k does not depend on k by (67), the channel is stationary. 

For any (n, M)-code defined on the DM-EH channel, let p WEn x „ Y „ w be the joint distribution 
induced by the code. Similar to (11), we can factorize p WEn X n yn w as 

Pw,E n ,X n ,Y n ,W = PW (n P E kPx k \W,E k pY k \X k \ P\V\ Y n, ( 68 ) 

which implies through straightforward calculations that 

n 

PW,E n ,X n ,Y n = PW,E n ,X n \\_PY k \X k - (69) 

k= 1 

Definition 8: For an (n, M)-c ode defined on the DM-EH channel, we can calculate according 
to ( 68 ) the average probability of decoding error defined as Pr {W 7 -W}. We call an (n, M)- 
code with average probability of decoding error no larger than e an (n,M,e)- code. 


For each e e [0,1), we define the c-achievable rate and the e-capacity in the same way as done 
in Definition 4 and Definition 5. Define the capacity-cost function 


C (P) = 


max 


E. 


p x :E PX [c(X)]=P 


pxqy\x 


log f 


Qy\x(Y\X) 


V Exe* Px{x)q Y \x(Y\x) J \ Px .e px [c(x)]=p 


max 


PxQy\x 


{X-Y). 

(70) 


Note that this is the capacity of the DMC when each codeword X n has to satisfy the cost 
constraint c (^f) < tiP. Similarly, define 


V(P) = max Var. 

p x :E Px [c{X)]=P 1 


PxQy\x 


log 


q Y \x{Y\X) 


T,xexPx(x)q Y \x(Y\x) 


(71) 


We also note that this is the maximum variance of the log-likelihood ratio of the channel and 
the output distribution where the input distribution is constrained to be such that the same cost 
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constraint is satisfied. The following theorem is the main result in this section. 
Theorem 2: Let e e (0,1), and define 


a = max 


{ E p Si l E fl max { {c(x)) 2 e c{x) \x e X}} 


For each sufficiently large integer n > 3 such that 


n > a 


log n P 2 ’ 

n — f lQ g (^T 


and 


n log n > 


e 0A (2 + e) 


there exists an (n*, M, e)-code with n* = n + 


6y/an log n 
P 


such that 


log M > n*C(P) - 


6C (P)y/an* log?r* (2 + e)n*X(P) 


P 


(n*)* -C(P) - 1. 


(72) 


(73) 

(74) 


(75) 


(76) 


Remark 5: Since E i E k converges to E P;;| \E\ = P with probability one by the strong 
law of large numbers, it follows from the power constraint (65) and the strong converse theorem 
for the DMC with cost constraint [20,24] that the e-capacity of the DM-EH channel is upper 
bounded by C (P), defined in (70). Therefore, Theorem 2 shows that the e-capacity of the DM- 
EH channel is C(P). In addition, Theorem 2 implies a lower bound on the second-order term 
that is proportional to — y/n*\ogn*. 

Remark 6: We observe that a > E [Ef\ > E[Pi] 2 = P 2 . Thus as P increases, the magnitude 
of the coefficient of the second-order term (that scales as \Jn* log n* and depends on the length 
of the saving period alone due to similar reasons provided for the Gaussian case in Remark 3) 
increases with P, which implies that a longer saving period is needed to maintain a fixed 
probability of outage (the term “outage” is as described in Remark 3). Similarly, as E[P 2 ] 
increases while P is fixed, the variance of the energy arrival process increases and hence a 
longer saving period is needed to maintain a certain fixed outage probability. 
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B. Save-and-Transmit Strategy 


Before we provide the proof of Theorem 2, we state the following useful lemma that is 
analogous to Lemma 1. 

Lemma 4: Let m and n be two natural numbers. Suppose {X k }^ =1 and {E k }™kff are two 
sequences of independent and identically distributed (i.i.d.) random variables such that X n and 
E m+n are independent, 

Pt PEi {E 1 <0} = 0, (77) 

and 

[-^i] = Epx, [c(A^i)] = P. (78) 

In addition, suppose there exists an a > 0 such that 

max {E PBi [Ef], max { (c(x)) 2 e c ^ | x 6 f}}<a. (79) 


If n > 3 is sufficiently large such that 


n 


a 

> — 


logn P 2 ’ 


( 80 ) 


then we have 


m+k 


Pr 


PX n P Em+n 




,fe=i k t=i 


i=i 


.0.4 


logn 


=2 log 


( 81 ) 


Proof: If we replace every instance of X\ with c(X k ) for each k E {1,2,... ,n} and set 
A = 1 in the proof of Lemma 1 in Appendix A, then the resultant proof will immediately lead 
to this lemma. ■ 

We are ready to prove Theorem 2 as follows: 

Proof of Theorem 2: Fix an £ E (0,1). Define 


a = max (E PSi [Ef\, max { (c(x)) 2 e c ^| x E A”}} . 


(82) 


Fix a sufficiently large n > 3 such that (73), (74) and (75) hold. Define 

6 yj an logn 


A 

m = 


P 


(83) 
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which specifies the number of time slots which are used for saving energy. Consider the random 
code that uses the channel m + n times as follows: 

Save-and-Transmit Random Codebook Construction 

Let 0 m denote the length-m zero tuple. Let p x be an arbitrary distribution on X such that 

V PX [c(X)] = P. (84) 


In addition, let p X n be the distribution of the n independent copies of A". Construct M i.i.d. 
random tuples denoted by A n (l), A n (2),..., X"(M) such that A n (l) is distributed according 
to px n , where M will be carefully chosen later when we evaluate the probability of decoding 
error. Define 

I m+n (i)4(0 m ,F(i)) (85) 


for each i G (1,2,..., M} and construct the random codebook 

{X m+n (i) | i G (1, 2,..., M}}. (86) 


The codebook is revealed to both the encoder and the decoder. Since the first m symbols of each 
random codeword X m+n (i) are zeros by (85), the source will just transmit 0 with probability 
one until time slot m + 1 when the amount of energy j 1 E/,. is available for encoding 

X m+1 (W) i = ) X 1 (W). 

Encoding under the EH Constraints 

For each i G (1,2let X k(i) be the k th element of X m+n [i) < = > (0 m , X n (i)) and 
construct recursively for k = 1, 2 ,..., m + n the random variable 


X 


kV"i 


E k ) 4 



if c(A fe (i))<Etx^ 

otherwise. 


1 


c(X e (i,E% 


(87) 


To send message W which is uniformly distributed on {1,2the source transmits 
Xk(W,E k ) in time slot k for each k G {1,2, ...,m + n}. Note that the source transmits 0 
with probability one in the first m times slots by (85) and (87), and the transmitted codeword 
(Xi(W, E 1 ), X 2 (W, E 2 ),..., X m+n (W, E m+n )) satisfies the EH constraints (65) by (87). 

Threshold Decoding 
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Upon receiving Y m+n which is generated according to 


. „ / m+n m+n m+n\ 

m+n 

= Pgra+n ^ Y m +™(W,i? m + n ) ( e ,W,X ) | (Z/fc 


( 88 ) 


fc=i 


where 


X m+n (W, E m+n ) 4 E 1 ), X 2 (W, E 2 ),..., X m+n (W, E m+n )) 


(89) 


denotes the transmitted tuple specified in (87), the destination constructs its subtuple denoted by 
Y" by keeping only the last n symbols of Y m+n . Recalling that q Y \x denotes the channel law 
and p x was chosen to satisfy (84), we define the joint distribution 

Px,y—PxQy\x, (90) 


and define px n ,Y n t0 0c the probability distribution of n independent copies of (A", Y) distributed 
according to px.Y, i-e., 

n 

Px",Y"{x n ,y n ) = Y[px,Y( X k,Hk) (91) 

k= 1 


for all ( x n ,y n ) G M 2 . Then, the decoder declares p(Y n ) G {1,2,... ,M} (we write p(Y n ) 
instead of ip(Y m+n ) to simplify notation) to be the transmitted message where is the 

decoding function defined as follows: If there exists a unique index j such that 


log 


p Y ^{Y n \X n {j)) 

PY n (Y n ) 


> log M + n*, 


(92) 


then ip(Y n ) is assigned the value j. Otherwise, +{Y r ) is assigned a random value uniformly 
distributed on {1,2 ,... ,M}. 

Calculating the Probability of Violating the EH Constraints 

Define X n (W,E m+n ) to be the tuple containing the last n symbols of X m+n (W, E m+n ). Fol¬ 
lowing similar procedures for proving (51), we can obtain from (87), (85), Lemma 4, (82), (73), 
(83) and (75) that 

Pr [X n (W, E m+n ) = X n (W)} > 1-—. (93) 

2 ~\~ s 
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Calculating the Probability of Decoding Error 

Choose M to be the unique integer that satisfies 


log(M + 1) > nE PXY 



( Py\ x (Y\X) Y 

V MY) J. 


(2 + e)n 


Var, 


PX,Y 


log 


Py\x(Y\X) 

MY) 


1 

n 4 


> log M. 


(94) 


Following similar proof steps from equation (52) to equation (59) for showing the existence 
of an [n + m,M,e )-code for the AWGN EH channel that satisfies (59), we can show that the 
constructed code is an (n + m, M,e)-c ode that satisfies 


log (M + 1) > nE pxY 



f p Yl x(Y\X) Y 

V MY) J. 


(2 + e)n 


Var, 


PX,Y 




— n 4 , 


which implies from (90), (84), (67) and (71) that 


log (M + 1) > nE px Y 



f py\x{Y\X) Y 

V MY) J. 


(2 + s)nV(P) 


— n 4 , 


(95) 


which then implies that 


1 + log M > nE, 


PX,Y 


log 


p y | X (V|X) \l _ / (2 + g)nV(P) 

MY) J V £ 


> ((m+n)—m) E, 


MY) J 

fpy lX (Y\X) 


n 4 


PX,Y 


log 


MY) 


(2 + e)(m + n)V(P) 


■ (m + n) 4 . 

(96) 


Since m > 0 and m < 6 ^ a p logn 


+ 1 by (83), it follows that 


6\/a(m + n) log(m + n) &^Jan\ogn 

+ 1 > -—-1-1 > m, 


P 


P 


(97) 


which implies from (96) and (70) that 


log M > (m + n) E, 


PX,Y 


log 


Pr(Y) ) 


6C (P)v^ (m + n) log(m + n) 


P 


(2 + e)(m + n)V(P) 


•(m + n)4 — C(P)— 1. 


( 98 ) 


The theorem then follows from (98), (82), (83), (84) and (70) by letting n* = m + n. 
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V. Conclusion and Future Work 

This paper has provided the first systematic study of finite biocklength achievabie rates over 
noisy EH channels. We observe that the backoff from capacity at a finite biocklength n is of the 
order logn). After the present work was submitted and placed on the arXiv, Shenoy 

and Sharma [25] used Kolmogorov’s inequality (a consequence of Doob’s maximal inequality) 
to bound the probability that the maximum (in time) of the difference between the harvested 
energy and the consumed energy (a martingale) exceeds a certain positive constant. They showed 
that the backoff term can be improved to 0(n _1//2 (logn) a ) for any a > 0. In fact, by combining 
our arguments and theirs, it is not difficult to further improve this backoff term to 0(n~ l/2 ). 
This 0(n -1//2 ) scaling (rate of growth) in the second-order term is clearly optimal [3,26]. At the 
same time, the development of new strategies for nailing down the constant is clearly a fertile 
avenue for future research. However, taking the n EH constraints in (1) into account to obtain a 
tighter bound from the meta-converse or its relaxed versions [3, Sec. III-E and III-F] (compared 
to the usual normal approximation) does not seem to be straightforward. 

Appendix 

Proof of Lemma 1 

We start with the following basic fact. 

Proposition 5: For each non-negative real number x, 

l+x<e x <l + x + x 2 e x /2 (99) 

and 

1 - x < e“* < 1 - x + x 2 /2 . (100) 

Proof: For any real number u G M, it follows from Taylor’s theorem that 

e u = 1 + u + u 2 e c 12 (101) 

where c is some number between 0 and u. Inequalities (99) and (100) follow from (101). ■ 

We are now ready to prove Lemma 1. 
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Proof of Lemma 1: Fix a sufficiently large n that satisfies (28) and fix a k e {1,2,.... n}. 
To simplify notation, define 

. lncn 

( 102 ) 


a logn 

In = 


an 


where a satisfies (27). Consider the following chain of inequalities, where subscripts of proba¬ 
bility and expectation terms are omitted for simplicity: 

( k m-\-k \ 

£X? > £E, 1 = p r{e^CV-ES**<) > l} 


r=i 


1= 1 


(a) 

< E 


= (E 


= V^(Eti^ 2 -Er= + i fe ^) 




(E [e-^]) m+fe 

< (l + Vt» e [A'?] + ) k (l - x/t^E [EJ + ^E[E?]) 

< (l + v^E [A?] + ^E [Afe^])‘ (l - Vt2E [E,] + ^E[E?]) 


ra+fc 


m+/c 


(f) / fco , a-v„ x fc / , a-v„ \ m + fc 

< ( g\/7n-P+ - 




= g^7n ( p -V7^P+^ 


< e fca 7n -^K 


(103) 


where 

(a) follows from Markov’s inequality. 

(b) follows from the facts that {X k }% =1 and { E k }f=f are two sequences of i.i.d random 
variables and X n and E m+n are independent. 

(c) follows from the upper bounds in Proposition 5. 

(d) uses the following fact due to (102) and (28): < A. 

(e) follows from (26) and (27). 

(f) follows from the lower bounds in Proposition 5. 

(g) follows from the fact that 
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(104) 



















29 


Consequently, it follows from the union bound that 
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( 103 ) 
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where (a) follows from the fact that for n > 3, cry™ * — ’ < 0.4. 


(105) 
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